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BACKGROUND OF THE INVENTION 
The present invention relates to storage systems, 
and in particular, to a method and apparatus for storing data 
on a virtual tape storage system. 
10 A virtual tape storage system is a hardware and 

software product configured to interact with a host computer. 
Application programs running on the host computer store data 
output on tape volumes for storage. These tape volumes are 
embodied in the virtual tape storage system as virtual volumes 
15 on virtual tape drives (VTD) . A virtual volume is a 

collection of data, organized to appear as a normal tape 
volume, residing in the virtual tape storage system. To the 
host computer and to the application programs, the tape volume 
contents appear to be stored on a physical tape device of a 
20 particular model, with the properties and behavior of that 
model emulated by the actions of the virtual tape storage 
system. However, the data may actually be stored as a virtual 
* volume on any of a variety of different storage mediums such 
as disk, tape, or other non-volatile storage media, or 
25 combinations of the above. The virtual volume may be spread 
out over multiple locations, and copies or "images" of the 
virtual volume may be stored on more than one kind of physical 
device, e.g., on tape and on disk. 

When an image of the virtual volume is stored on 
30 disk, different portions of the volume's contents may be 
stored on different disk drives and on different, non- 
contiguous areas of each of the disk drives. The virtual tape 
storage system maintains indexes which allow the contents of 
any virtual volume whose image is stored on disk to be read by 
35 the host, the virtual tape storage system retrieving scattered 
parts as needed to return them in correct sequence. 

When an image of a virtual volume is stored on tape, 
it may be stored on a single tape together with images of 
other virtual volumes, or different parts of the image may be 
40 stored oh more than one different tape with each part again 
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placed with images, or parts of images, of other virtual 
volumes . In both of these approaches to tape storage of 
virtual volume images, the images are said to be "stacked." 
The virtual volume images may be stored on a variety of 
different tape device models other than the one being 
emulated. As with images stored on disk, the virtual tape 
storage system maintains indexes which allow it to retrieve 
the contents of any virtual volume stored in a stacked image 
from the tape or tapes on which it is stored; 

A shortcoming of storing stacked images on tape 
arises because the stacked image is not recognizable by 
standard hardware and application programs. 

Existing virtual storage systems include proprietary 
tape drive units for destaging virtual volumes from staging 
disks to tape. If, as is usually the case, the customer has 
already invested in tape library hardware the addition of- a 
virtual tape drive system requires adding additional tape 
drive resources to perform destaging operations for the 
virtual tape drive system. 

Thus, an improved virtual tape system and methods 
for its operation that overcomes the shortcomings of the 
presently available devices is needed. 

SUMMARY OF THE INVENTION 

According to one aspect of the present invention, a 
virtual library manager (VLMAN) subroutine, part^ of a Library 
Management System (LMS) running on the host computer, 
interfaces the virtual storage system and the host computer. 
VLMAN interacts with software provided with the existing tape 
library to access physical tape volumes mounted on tape drives 
in tape library. 

According to another aspect of the invention, the 
contents of virtual volumes staged on staging disks on the 
virtual tape server may be destaged to the physical tapes 
mounted on the tape library to reclaim space in the virtual 
tape server. 
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Other features and advantages of the invention will 
be apparent in view of the following detailed description and 
appended drawings . 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. lA is a conceptual block diagram of a preferred 
embodiment of the invention; 

Fig. IB is a block diagram of a preferred embodiment 
of a tape drive emulating (TDE) system according to the 
present invention; 

Fig. 2a is a representation of a packet; 

Fig. 2b is a representation of packet contents for 

compressed user data; 

Fig. 2c is a representation of packet contents for 

uncompressed user data; 

Figs. 3a and b are flow charts of steps performed by 
an embodiment of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

A preferred embodiment will now be described with 
reference to the figures, where like or similar elements are 
designated with the same reference numerals throughout the 

several views . 

Fig. lA is a high-level block diagram of a digital 
system in which a preferred embodiment of a virtual tape 
storage system of the present invention is utilized. In Fig. 
lA, a host computer 10, for example an IBM mainframe computer, 
executes a plurality of applications 12. In practice, host 
computer 10 typically runs the MVS operating system 
manufactured by IBM, although other operating systems are well 
known to one of skill in the art and may also be used. MVS 
provides I/O services to various applications 12 including I/O 
for a tape unit 20, which may be an automatic tape library 
(ATL) , or other type of tape storage device. Applications 12 
may be coupled directly to tape unit 2 0 through ESCON tape 
devices (ETD) 24 by means of a physical interface such as an 
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ESCON 3490 Magnetic Tape Subsystem Interface 22. MVS, the 
ESCON interface 22, and the host computer 10 are well-known in 
the art . 

Applications 12 may also be coupled to a virtual 
tape server 30, also referred to herein as an open system 
server (OSS) . OSS is manufactured by the assignee of the 
present invention. Virtual tape server 30 maintains virtual 
tape drives 32 (VTDs) , which emulate the physical ETDs like 
those at 24. More details of the VTDs 32 will, be presented 
below. The interface between an application 12 and a VTD 32 
is OSS Emulated Device interface 33, which in the preferred 
embodiment is an ESCON interface. 

A library management system (LMS) software module 34 
also resides on host 10 and provides services to MVS and 
virtual tape server 30. LMS 34 is responsible for management 
of the tape library environment and performs such tasks as 
fetching and loading cartridges into drives, returning 
unloaded cartridges to their home locations, etc. The 
interface between LMS 34 and virtual tape server 3 0 is the 
Library Manager Interface with paths 35a and 35b based on two 
different and distinct protocols. 

VTD 32 is a non-physical device that responds as if 
it were a physical device. In the currently described 
embodiment, the emulated physical device is an IBM-3490 tape 
drive, although other devices may also be emulated. VTD 32 
responds to commands issued on a channel in the same fashion 
as the emulated technology. Thus, the absence of a physical 
tape device may be unknown to application 12. 

Applications 12 typically store data in tape 
volumes. Tape volumes are well-known data structures. A 
"virtual volume" is a collection of data and metadata that, 
taken together, emulate a real tape volume. When "mounted" on 
a VTD, these virtual volumes are indistinguishable from real 
tape volumes by the host computer. In this context, "data" 
refers to data output by the host to be stored on tape and 
"metadata" refers to information generated by virtual tape 
server 30 which permits the emulation of real tape drives and 
volumes . 
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Fig. IB is a high level block diagram of a part of 
virtual tape server 30 utilizing an embodiment of the present 
invention that may be coupled to one or more host computers 10 
(Fig. lA) . Host computers 10 are typically large mainframe 
computers running an operating system such as MVS, and various 
application programs. 

A plurality of channel interfaces (CIFs) 42 are 
coupled to host I/O channels (not shown) to transfer data 
between host 10 and virtual tape server 30. • .. 

Each CIF 42 includes a host interface 44, an 
embedded server 46, a data formatter 4 8 for performing data 
compression and other functions, a buffer memory 50, an SBUS 
interface 52, and an internal bus 54. In the preferred 
embodiment, the embedded processor 46 is a model i960 
processor manufactured by Intel Corporation. 

A main controller 60 is coupled to CIFs 42 and 
includes a main processor 62, a main memory 64, an SBUS 
interface 66, and an internal bus 68. In the preferred 
embodiment, the main processor is a SPARC computer 
manufactured by Sun Microsystems, Incorporated. CIFs 42 and 
main controller 60 are coupled together by a system bus 70, 
which is an SBUS in the preferred embodiment. 

Virtual tape server 30 stores host data in virtual 
volumes mounted on VTDs 32. In one preferred embodiment, the 
data is originally stored on staging disks 80. Because 
virtual tape server 30 must interact with the host as if the 
data were actually stored on physical tape drives, a data 
structure called a virtual tape drive descriptor is maintained 
in main memory 64 for each VTD 32. The virtual tape drive 
descriptor contains information about the state of the 
associated VTD 32. Additional structures, including a virtual 
tape "volume" structure and other structures subordinate to 
it, register the locations at which data is physically stored, 
among other information. 

Subsequently, data may be transferred from staging 
disks 80 to one or more magnetic tape units.' As mentioned 
above, tape units 20 may be individual tape units, automatic 
tape libraries (ATLs) , or other tape storage systems. 
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However, the location and other properties of the data is 
still defined in terms of the virtual tape volume structures 
in memory and stored in a disk-based control data set. 

An example will help clarify the meaning of the 
terms. If application 12 intends to write data to tape, it 
requests that a tape be mounted on a tape drive. LMS 
intercepts the request and causes a virtual volume to be 
mounted on one of the VTDs 32. to receive the application 
output, which is delivered by the ordinary tap.e output 
programs of the MVS operating system. Blocks of data received 
by virtual tape server 30 are "packetized" , the packets are 
grouped together in clusters with a fixed maximum size, called 
"extents", and the extents are written to staging disks 80 in 
virtual tape server 30. The staging disk space is treated as 
collections, called regions, of fixed-size space units called 
extents. Thus, data stored or to be stored in an extent is 
transferred between the controller and the staging disks 
during staging disk read/write operations. 

Often the extents containing data from one virtual 
tape are scattered over several disk drives. All information 
about the packetization, such as packet grouping in extents 
and extent storage locations, required to reassemble the 
volume for later use by the host is metadata. Part of the 
metadata is stored with each extent and part is stored on non- 
volatile storage in virtual tape server 30, separate from the 
extent storage. 

Data transferred from a host to a tape drive is 
sequential. The packets are stored in an extent in order 
sequentially by block number. A system for serializing 
packets is disclosed in the commonly-assigned co-pending 

application entitled "Data Serialization", filed 

(Attorney docket #18121-4-1) . 

Formatting a data block under this method produces a 
"packet" 200 as shown in figure 2. Packet 200 has a header 
210 that includes, for example, a Packet-Id, user-data 220, 
and a trailer 230. Packet 200 is shown in more detail in 
Figures 2b and 2c. Packet 200, which may conform, for example 
to ANSI standards X3. 224-1994 and X3. 225-1994, contains a 
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version of the hosts data block, compressed or, optionally not 
compressed, and descriptive control information such as the 
sequential number of the block in the sequence of all blocks 
written to a virtual tape volume, the lengths of the block, 
before and after compression, flags signaling whether 
compression was used and which of allowable compression 
algorithms was used, and calculated "CRC" check characters 
useful for verifying that packet 200, when transmitted from 
one storage system component to another, survived without 
corruption. In other words, the parts of packet 200 make the 
formatted block substantially self -describing . 

In the present invention, data sets stored on 
virtual volumes are destaged from the staging disks to the 
existing tape library attached to the host. Accordingly, the 
user's existing resources are utilized and no redundant 
investment in additional tape drive libraries is require. In 
the preferred embodiment the data sets are stacked on tapes in 
an existing tape library coupled to the host computer and 
accessed by standard programs resident on the host. 

The LMS software module includes a virtual library 
manager (VL^4AN) submodule. VL^4AN includes hooks to the host's 
existing tape drive accessing methods. When data must be 
destaged from OSS, VLMAN requests access to the host's tape 
libraries. Read (for reading data from the OSS) or write (for 
writing it to tape) directives are then issued from VLMAN to 
the host which executes the utilizing existing software. 

A preferred embodiment of the invention will now be 
described with reference to the flow charts of Figs. 3a and 
3b. The virtual tape library process (VTL) runs several 
monitor routines in parallel. A Start Health Check Process 
monitors the system for subsystem degradation. If RAID usage 
is critical a space manage routine is started. The space 
manage routine is also started periodically, on fixed time 
intervals . 

The space manage routine Requests a RAID status 
report. In a preferred embodiment this report is generated by 
mounting an administrative volume as described in a commonly 
assigned copending application entitled "IMPROVED INTERFACES 
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FOR AN OPEN SYSTEMS SERVER PROVIDING TAPE DRIVE EMULATION" 

(Attny. Docket No. 18121-6-1, filed ). 

The active and unassigned space from each virtual 
volume pool is read and an SMF (system management facility) 
5 entry containing RAID status report and usage is created for 
subsequent reporting functions. The percentage of active 
space is compared with a user defined parameter to determine 
whether space reclamation is required. 

The steps for space reclamation are ,^et forth in 
10 Fig. 3b. First it is determined whether a previous space 

reclaim is still running. If not, the reclaim process gets 
the next virtual volume pool to process. A volume candidate 
list for reclamation is created and work is handed over to 
stacking programs of VLMAN. 
15 These stacking programs use the tape library 

programs already on the host, under control of VLMAN, to read 
data sets from virtual volumes in the OSS and write them to 
the physical tapes mounted on the physical tape drives in the 
tape library. 

20 While the above is a complete description of 

specific embodiments of the invention, various modifications, 
alternative constructions, and equivalents may be used. 
Therefore, the above description should not be taken as 
limiting the scope of the invention as defined by the claims. 
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WHAT IS CLAIMED IS: 

1. A virtual tape storage system coupled to a host 
computer, with the host computer coupled to a physical tape 
drive and having tape library software for accessing physical 
volumes mounted on the physical tape drive, and with the 
virtual tape storage system responding to commands from the 
host computer as an emulated tape unit, the emulated tape unit 
having an expected format for storing data, the virtual tape 
storage system comprising: 

staging disks for storing virtual volumes; 
a processor for organizing data in the virtual tape 
storage system according to the emulated format; 

a virtual library manager program, running on said 
host computer, for transferring data in virtual volumes to 
physical volumes mounted on the tape library, with the virtual 
library manager program utilizing accessing methods in the 
tape library software to read data from the virtual tape 
storage system and write data to the physical tape volumes 
mounted on the tape drives of tape library. 
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